\ 


Historic,  Archive  Document 

Do  not  assume  content  reflects  current 
scientific  knowledge,  policies,  or  practices. 


f5  A  Com 


Sorting 
on  Fuel  Depths 

Frank  A.  Albini 


1 


CO 

%4 


USDA  Forest  Service 

General  Technical  Report  INT 23,  1975 

INTERMOUNTAIN  FOREST  AND  RANGE 

EXPERIMENT  STATION 

Ogden,  Utah  84401 


THE  AUTHOR 


FRANK  A.  ALBINI  is  a  Research  Mechanical  Engineer  in  the  Fire  Fundamentals 
research  work  unit  at  the  Northern  Forest  Fire  Laboratory  in  Missoula, 
Montana.  He  received  a  B.S.  in  1958,  an  M.S.  in  1959,  and  a  Ph.D.  in  1962 
in  Aeronautical  and  Mechanical  Engineering,  with  a  Philosophy  minor,  from 
the  California  Institute  of  Technology.  From  1960  to  1963,  he  worked  at 
Hughes  Aircraft  Company  as  a  member  of  a  small  research  staff  in  the  En- 
gineering Laboratories  of  the  Space  Systems  Division.  From  1963  to  1966, 
and  again  from  1971  to  1973,  he  worked  as  a  member  of  the  research  staff  at 
the  nonprofit  Institute  for  Defense  Analyses,  performing  engineering  evalua- 
tions and  systems  analyses  for  the  Department  of  Defense.  From  1966  to 
1971,  he  was  with  the  General  Research  Corporation  in  Santa  Barbara  as  a 
member  of  the  technical  staff  and  as  Director  of  the  Defensive  Systems  De- 
partment. In  1973  he  joined  the  staff  at  the  Northern  Forest  Fire  Laboratory 
to  devote  full  time  to  what  had  been  a  continuing  avocation — research  on 
free -burning  fire. 


USDA  Forest  Service 
General  Technical  Report  INT-23 

July  1975 


A  COMPUTER  ALGORITHM  FOR  SORTING 
FIELD  DATA  ON  FUEL  DEPTHS 


Frank  A.  Albini 


INTERMOUNTAIN  FOREST  AND  RANGE  EXPERIMENT  STATION 

Forest  Service 
U.S.  Department  of  Agriculture 
Ogden,  Utah  84401 
Roger  R.  Bay,  Director 


CONTENTS 


Page 


INTRODUCTION   1 

METHOD    3 

Equations   6 

A.  Bin  Width  Determination   6 

B.  Numerical  Rating  of  Histograms   7 

C.  Sequence  of  Operations   9 

Test  Cases   9 

Field  Data   11 

Program  Available   14 

PUBLICATIONS  CITED   14 

APPENDIX   15 


ABSTRACT 

Describes  an  algorithm  for  separating  fuel  depth  data  into 
distinctive  groups  of  measurements.  The  algorithm  is  justi- 
fied heuristically,  through  its  mathematical  similarity  to  the 
"tuner"  of  a  radio  receiver.  Test  data  generated  artificially 
were  used  to  maximize  the  sensitivity  of  the  algorithm.  The 
algorithm  has  been  applied  successfully  to  field  data  gathered 
by  standard  fuel  inventory  procedures,  but  output  should  be 
verified  by  someone  familiar  with  the  sampled  area.  A  rough 
flow  chart  and  a  FORTRAN  listing  are  included.  Copies  of  the 
computer  program  are  available  from  the  author. 


INTRODUCTION 


Several  published  methods  for  inventorying  wildland  fuels  (Van  Wagner  1968;  Broun 
1971;  USDA  Forest  Service  1974)  and  another  in  preparation1  entail  the  sampling  of  fuel 
depths.    The  vertical  height  to  the  uppermost  fuel  component  is  recorded  at  each  of 
many  sample  points.    When  the  net  of  sample  points  is  sufficiently  dense,  this  procedure 
should  approximate  the  "sheet  depth"  estimation  procedure  (Brown  1971);  but  in  some 
cases  the  captured  information  would  prohibit  the  interpretation  of  it  as  such  without 
special  effort. 

To  illustrate,  consider  the  following  conceptual  arrangement  of  fuel  pieces: 
Dowels,  1  cm  in  diameter,  extending  vertically  1  m  above  the  terrain  surface,  are 
placed  20  cm  apart  in  a  uniform  square  grid.     Using  the  "sheet  depth"  estimation  pro- 
cedure, a  1-m  average  fuel  depth  would  be  deduced.     If  a  sampling  technique  were  used, 
with  the  sample  locations  chosen  at  random,  the  resulting  average  depth  would  depend 
upon  the  details  of  the  sampling  procedure. 

One  common  method  of  depth  sampling  is  the  random  selection  of  points  at  which 
samples  are  to  be  made.     Then  a  5-cm  radius  circle  is  lowered  vertically,  with  the 
plane  of  the  circle  held  horizontally,  until  the  circular  area  intercepts  a  fuel  element. 
In  our  example,  the  intercept  would  occur  at  a  height  of  1  m  if  the  center line  of  a 
dowel  fell  within  5.5  cm  of  the  center  of  the  sampling  circle.     In  other  words,  areas 
of  tt(5.5)2  =  95  cm2  are  selected  at  random  from  the  general  area.     If  a  dowel  center- 
line  lies  within  that  area,  a  depth  of  1  m  is  recorded;  if  not,  a  depth  of  zero  is 
recorded.     Since  the  density  of  dowel  centerlines  in  the  area  is  one  per  400  cm2,  the 
probability  of  intercepting  a  centerline  on  any  given  measurement  is  95/400,  or  0.238. 
So,  for  a  large  enough  set  of  sample  points,  the  recorded  fuel  depths  would  be  23.8 
percent  at  1  m  and  76.2  percent  at  zero,  and  an  average  depth  of  23.8  cm  would  be 
computed . 


iFrandsen,  William  H.     A  firespread  model  for  spatially  nonuniform  forest  floor 
fuel  arrays.     Problem  analysis,  USDA  For.  Serv.,  Intermt.  For.  and  Range  Exp.  Stn., 
North.  For.  Fire  Lab.,  Missoula,  Mont,   (in  preparation). 
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Figure  1. — Histogram  of  hypothetical  depth-measurement  data  showing  three 
superposed  but  distinctive  groupings  of  depths. 

We  could  have  specified,  for  instance,  that  the  spread  of  depths  within  which  a 
measurement  must  fall  to  be  included  in  a  given  bin  should  vary  linearly  with  the 
median  depth  measurement  for  each  bin.     For  example,  we  could  increase  by  0.1  cm  the 
spread  of  depths  for  each  successive  bin,  starting  with  the  smallest,  and  construct  a 
sequence  of  partition  points  as  follows:  1.1,  2.3,  3.6,  5.0,  6.5,  etc.     This  would  lead 
to  a  different  histogram  (fig.  2). 

The  procedure  just  outlined  is  somewhat  arbitrary,  but  has  a  least  heuristic 
justification.     A  measurement  resulting  in  a  recorded  fuel  depth  of  2  cm  is  unlikely  to 
change  by  1  cm  if  repeated.     But  a  measurement  of  30  cm  could  easily  become  29  or  31  cm 
if  repeated.    Not  that  the  measurements  would  necessarily  be  in  error,  but  that  minor 
variations  in  windspeed,  angle  of  solar  illumination,  etc.,  can  result  in  small  physical 
displacements  of  the  fuel  elements,  and  thus  change  the  depth  measurements.     The  greater 
the  fuel  depth,  the  greater  the  latitude  for  variations  in  depth,  and  the  less  important 
are  such  variations  in  determining  fire  behavior. 

Two  significant  features  of  the  revised  histogram  are  that  some  of  the  vertical 
bars  have  become  longer  and  that  the  patterns  are  almost  fully  contiguous  within  each 
group.    My  subjective  view  is  that  the  lack  of  gaps  between  the  vertical  bars  is  an 
important  factor  in  recognizing  groupings.     The  vertical  height  of  prominent  bars  also 
seems  to  attract  the  eye  and  to  lead  one  to  recognize  the  group  with  the  longer  bars 
more  readily  than  the  others.     It  is  these  two  features  which  have  been  used  as  key 
elements  in  the  algorithm  described  below,  permitting  a  digital  computer  to  sort  raw 
data- -a  set  of  depth  measurements — into  groups. 
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Figure  2. — Reconstruction 
of  histogram  of  figure  1 
using  variable  depth 
spreads  for  combining 
measurements . 


Bin  number 


If  the  forms  of  the  distribution  functions  of  the  various  groups  of  data  are  known, 
a  much  more  sophisticated  (and  probably  more  accurate)  method  of  extracting  the  groups 
could  be  used.     This  would  consist  of  the  determination  of  the  Fourier  coefficients  for 
a  series  of  distribution  functions  whose  sum  represents  the  histogram.  Without 
knowledge  of  even  the  form  of  the  distribution  functions,  we  are  forced  to  turn  to 
cruder  means  of  analysis. 

The  approach  that  was  taken  in  formulating  the  algorithm  can  be  described  in  the 
following  terms: 

1.  A  set  of  histograms  is  constructed,  using  different  numbers  of  bins,  with  bin 
widths  varying  from  essentially  constant  to  rapidly  increasing.     The  variation  in  bin 
widths  is  linear  in  the  sense  that  the  difference  between  widths  of  adjacent  bins 
increases  linearly  with  the  ordinal  numbers  of  the  bins  (as  in  the  example  above). 

2.  Each  histogram  is  examined  to  determine  the  most  prominent  group  represented 
on  it.     This  is  done  by  a  thresholding  procedure,  analogous  to  laying  a  piece  of  paper 
over  the  bottom  of  the  histogram,  with  a  straight  edge  horizontal,  and  sliding  it  up 
the  page  until  only  one  set  of  contiguous  vertical  bars  project  above  the  edge.  On 
moving  the  paper  downward,  the  lower  the  edge  of  the  masking  sheet  can  be  brought  be- 
fore a  disjoint  vertical  bar  appears  above  the  edge,  the  more  prominent  is  the  visible 
group . 

3.  A  numerical  rating  for  each  histogram  is  computed.     This  rating  scores  the 
histogram  on  the  basis  of  the  degree  of  prominence  of  the  prominent  group  and  the 
compactness  of  the  distribution  included  in  the  prominent  group.     This  parameter  is 
described  below. 

4.  The  membership  in  the  most  prominent  group  is  determined.     These  measurements 
are  averaged,  then  deleted  from  the  measurement  set.     If  sufficient  data  remain,  the 
entire  process  is  repeated  to  determine  the  next  group. 

The  algorithm,  as  currently  implemented,  is  repeated  to  determine  three  groups. 
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Equations 


The  equations  used  in  implementing  the  steps  outlined  above  are  briefly  developed 
below.     Clearly,  a  major  effort  of  the  algorithm  implementation  was  development  of  the 
logical  framework  and  the  bookkeeping  of  data  manipulation.     But,  these  are  not  germane 
to  the  analytical  content  and  will  not  be  discussed  here. 

It  is  assumed  here  that  the  data  are  all  integers.     If  the  data  were  not  in  integer 
format,  the  values  could  be  multiplied  by  the  appropriate  power  of  10  to  make  them  in- 
tegers.    The  maximum  value  of  200  used  here  is  arbitrary,  but  represents  a  reasonable 
upper  limit  to  ground  fuel  depth  in  centimeters. 

A.     Bin  Width  Determination 

If  the  largest  (integer)  value  of  the  depth-measurement  data  set  is  H,  the  smallest 

the  number  of  bins  into  which  the  data  are  to  be  partitioned  is  N,  and  width  of 
.th 

the  ,7      bin  is  W.,  then 

3 

N 

I     W  .  =  {E  -  S)  (1) 
J  =  l  J 

Since  it  is  desired  that  the  width  of  each  successive  bin  be  greater  by  a  constant 
(a)  than  the  preceding  one,  we  have  also 

W.  =  W.  ,  +  a,  or  Wi  -  W.  .  =  a  (2) 
3        J-l  J  J-l 

Summing  equation  (2)  from  j=2  to  j=J  gives  the  general  form: 

W    =  W    +  (J-  -  l)a  (3) 
J  1 

Employing  the  closure  constraint  (equation  1)  determines  the  value  of  the  width- increase 
parameter,   {a3  as  a  function  of  W      the  first  bin  width,  and  the  other  constants  of  the 
problem: 

N=l 

NW    +  al    3  =  H  -  S;    a  =  (H  -  S  -  NW  )/  (N(N  -  l)/2)  (4) 
3  =  0 


Note  that  when 

W1  =  (H  -  S)/N  (5) 

the  widths  remain  constant.     This  is  one  case  that  is  considered  for  each  value  of  N. 
The  values  of  N  and        are  selected  so  that  a  is  nonnegative. 

Values  of  N  considered  are  incremented  downward  from  Max  (A?)  ,  where 

MaxOV)  =  Min(200,  [H  -  S])  (6) 
in  steps  of  AN.     This  value  is  clearly  arbitrary,  but  by  experimentation,  it  was  found 
that 

LN  =  Max(l,  0.04  Max(iV))  (7) 
works  adequately. 


—    All  zero  measurements  are  purged  at  the  outset. 
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Values  of         considered  are  incremented  upward  from  Min(Jtf),  where 
Min(J7  )  =  (ff  -  S)/N  (8) 
in  arbitrary  small  steps  of  A 17  .     The  value    used  was  decided  by  experimentation: 
Af7x  =  0.075  MinC^)  .  (9) 

In  its  current  configuration,  the  algorithm  employs  22  values  of  N  and  22  values 
of  W  ,  generating  484  histograms  each  time  a  data  set  is  fed  to  the  algorithm.  There 

is  a  tradeoff  between  thoroughness  and  cost   (running  time) .     These  numbers  may  in  fact 
be  larger  than  necessary. 

B.     Numerical  Rating  of  Histograms 

We  desire  to  establish  a  numerical  rating  scheme  that  will  establish  a  ranking  of 
the  various  histograms  according  to  how  well  they  exhibit  the  existence  of  a  prominent 
grouping  of  measurements.     From  general  considerations,  this  scheme  should  have  the 
following  features: 

1.  All  other  factors  being  equal,  the  greater  the  fraction  of  the  total  number 
of  data  points  in  the  histogram  that  fall   into  the  prominent  group  (a  group  must  have 
contiguous  vertical  bars  on  a  histogram  presentat ion--the  prominent  group  has  the  tall- 
est vertical  bar),  the  greater  should  be  the  score. 

2.  As  a  means  of  establishing  the  relative  degree  of  prominence  of  the  prominent 
groups  on  the  different  histograms,  the  score  of  each  should  be  reduced  by  the  amount 
of  data  hidden  below  the  threshold  established  by  the  second-most-prominent  group  on 
each  histogram.     So  the  number  of  bins  contiguous  in  the  prominent  group,  multiplied  by 
the  value  of  the  threshold  setting  (expressed  as  a  fraction  of  total  data  points)  should 
be  deducted.     This  would  leave  the  exposed  fraction  as  the  score. 

3.  To  reduce  the  tendency  of  such  a  scoring  scheme  as  outlined  in  the  first  two 
steps  to  favor  ever-wider  bin  widths,  some  penalty  must  be  assessed  for  using  wider  and 
wider  bins.     Else,  in  the  limit,  all  data  would  be  lumped  into  a  few  large,  contiguous 
bins  and  the  effort  would  fail  its  purpose.     A  satisfactory  way  to  assess  this  penalty 
is  to  divide  the  score  defined  in  step  No.   2  by  the  square  root  of  the  sum  of  the  bin 
widths  included  in  the  prominent  group. 

Taken  together,  these  three  considerations  lead  to  a  scoring  scheme  that  character- 
izes group  prominence  in  terms  of  tall,  narrow,  contiguous  histogram  bars  that  stand 
farthest  above  the  next-most-prominent  group. 

For  those  familiar  with  the  concepts  and  terminology  of  radio  communications,  the 
analog  of  the  histogram  rating  score  is  a  "signal-to-noise  ratio."     In  this  analogy, 
the  fraction  of  the  total  number  of  depth  measurements  that  lies  in  a  contiguous  group 
projecting  above  the  minimum  threshold  plays  the  part  of  a  voltage  signal.     The  noise 
component  in  the  denominator  is  the  square  root  of  the  total  width  of  the  bins  included 
in  the  above-threshold  span.     Extending  this  analogy  one  step  further,  the  algorithm 
as  a  whole  can  be  viewed  as  a  procedure  for  generating  a  band-pass  filter  bank  that 
maximizes  the  signal-to-noise  ratio  defined  in  this  way. 
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The  equation  used  for  scoring  in  the  algorithm  is 

K  K 


S  =  (Z    n./N    -  (K  -  J  +  l)/)/([l  +  6  +  E    J?  ])*  (10) 

i=J  ^    1  i=J  ^ 

where 

5  =  histogram  rating  score 

n .  =  number  of  data  points  in  bin  number  i 
/Vy  =  total  number  of  data  points  in  histogram 

J    =  smallest  ordinal  number  of  bins  in  contiguous  group,  each  of  whose  data  point 
count  exceeds  the  threshold  number 

K    =  largest  ordinal  number  of  bins  in  contiguous  group,  each  of  whose  data  point 
count  exceeds  the  threshold  number 

f    =  threshold  number  expressed  as  a  fraction  of  total  data  points  in  histogram. 
This  is,  fN    is  the  data  point  count  in  a  disjoint  bin  which  sets  the 
threshold 

W.  =  width  of  bin  number  i 

i 

6  =  0.075  W  ,  merely  a  handy  small  quantity  to  eliminate  spurious  roundoff 

results  when  the  denominator  is  rounded  down  to  the  next  lower  integer  (the 
1  is  added  for  similar  stability  reasons). 

[x]  =  next  smaller  integer  than  x.  Integer  values  are  used  because  a  unit  repre- 
sents the  smallest  possible  change  in  a  data  point  value,  so  the  smallest 
meaningful  change  in  bin  width. 


C .     Sequence  of  Operations 

The  sequence  of  operations  in  the  algorithm  described  is  shown  in  the  flow  diagram 
of  figure  3.  Most  of  the  logical  and  bookkeeping  details  have  been  suppressed  and  only 
the  outline  of  operations  is  depicted. 

Test  Cases 

Several  test  cases  were  used  to  perfect  and  exercise  the  computer  program  depicted 
in  figure  3.     The  purpose  of  these  test  cases  was  to  establish  the  ability  of  the 
program  to  discern  groups  in  artificial  data  generated  by  computer. 

First,  a  background  set  of  data  were  generated  by  selecting  random  integers 
between  0  and  100,  inclusive.     Then  three  additional  sets  of  data  were  generated  by 
selecting  numbers  at  random  from  triangular  distributions.     The  range  of  these  added 
data  points  varied  from  trial  to  trial,  but  typically  there  would  be  one  with  a  mean 
in  the  range  5-10,  one  with  a  mean  in  the  range  15-20,  and  one  in  the  range  60-70. 
The  number  of  points  in  the  background  distribution  was  varied,  as  was  the  number  of 
points  in  each  of  the  triangular  distributions  superposed  on  the  background.  The 
purpose  of  these  trials  was  to  determine  the  point  at  which  the  algorithm  could  no 
longer  discern  the  superposed  triangular-distribution  sets  of  data  as  distinctive 
groups  against  the  noise  background. 
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Figure  4. — Accuracy  of 
estimation  of  distribu- 
tion mean-versus- density 
of  sample  points  from 
triangular  distribution 
superposed  on  a  flat 
distribution  background 
noise. 


The  results  of  these  tests  were  expressed  as  the  percentage  error  in  the  estima- 
tion of  the  means  of  the  superposed  data  groups.     A  parameter  that  characterizes  the 
degree  to  which  the  superposed  distribution  is  immersed  in  the  background  random  data 
is  the  ratio,  R,  where 

i?  =  Density  of  data  points  in  range  of  added  distribution  (11) 
Density  of  data  points  in  the  background  data  set 

The  density  of  data  points  is  here  defined  to  be  the  total  number  of  data  points 
in  a  specified  range  of  values,  divided  by  the  span  of  values  in  that  range.  For 
example,  in  the  case  represented  by: 

1.  One  hundred  random  data  points  in  the  range  0-100  (background) 

2.  Twenty  data  points  added  from  a  triangular  distribution  in  the  range  10-20 

3.  Suppose  seven  of  the  random  data  points  also  fell  in  the  range  10-20; 

the  density  of  data  points  in  the  range  of  the  added  distribution  would  be  (20  +  7)/ 
(20  -  10)  =  2.7.     The  density  of  data  points  in  the  background  data  set  would  be 
(100)/(100  -  0)  =  1.0,  so  R  =  2.7/1.0  =  2.7. 

This  ratio  is  used  as  the  abscissa  of  the  scatter  diagram  plotted  in  figure  4. 
This  figure  shows  all  the  test  case  exercises  of  the  group-finding  algorithm.  The 
algorithm  exhibits  the  tendency  characteristic  of  its  electrical  analog  discussed 
above,  namely  that  it  either  works  very  well  or  fails  dramatically,  and  that  this 
failure  is  to  be  expected  around  a  value  of  5  or  4  for  the  ratio  defined  above. 
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Note  that  in  all  cases  where  the  ratio  R  is  greater  than  4,  the  algorithm  dis- 
covered the  superposed  distributions  and  established  their  mean  values  with  an  error 
less  than  5  percent.     Of  the  seven  cases  shown  with  R  less  than  4,  two  distributions 
were  recovered  and  their  means  established  with  accuracy  better  than  5  percent;  one  was 
recovered  but  the  mean  was  13  percent  in  error;  three  were  not  recovered  accurately, 
although  their  means  were  estimated  (one  with  5  percent  error,  one  with  17  percent 
error,  and  one  with  57  percent);  and  one  distribution  was  not  recovered  at  all. 

Because  the  performance  of  the  algorithm  closely  paralleled  that  which  would  have 
been  expected  on  the  basis  of  the  analogy  that  was  used  in  constructing  it,  no  further 
developmental  work  was  done.     The  procedure  has  been  used  on  actual  field  data  as 
discussed  in  the  following  section. 

Field  Data 

As  part  of  the  White  Cap  Drainage  Wilderness  Fire  Management  Study, 3  fuel  inventory 
data  were  taken  in  1971,  1972,  and  1973.     The  data  were  reduced  to  punched-card  formats, 
categorized  by  timber  cover  type  and  by  ecological   (climax  tree  species/understory 
species)  habitat  type.4 

We  hoped  to  exercise  Rothermel's  spread  rate  model  against  these  data  to  help 
establish  prescription  guidelines  for  fire  management  in  the  study  area.     To  do  so 
required  not  only  fuel  loading  data  but  representative  fuel  depth  estimates.  Because 
the  field  data  did  not  include  the  nature  of  the  fuel  that  resulted  in  the  recorded 
depths,  some  sorting  procedure  was  necessary. 

The  automated  procedure  outlined  above  was  used  with  these  data,  resulting  in  a  set 
of  depth-groups  for  each  habitat  type  and  timber  type.     These  abbreviated  data  were 
then  reviewed  for  consistency  and  to  see  if  the  discovered  distributions  were  really 
representative  of  the  areas  in  question.     Persons  familiar  with  the  general  regions 
under  investigation  could  review  these  output  data  much  more  readily  than  they  could  the 
raw  data. 

In  perhaps  two-thirds  of  the  cases  reviewed,  the  depth-groupings  recovered  by  the 
algorithm  were  acceptable  to  the  reviewers.     In  the  other  cases,  the  consensus  was  that 
the  algorithm  had  failed  to  group  the  data  in  a  meaningful  way. 

Some  outputs  are  obviously  spurious  even  to  someone  unfamiliar  with  the  area  under 
investigation.     An  example  of  such  a  result  occurred  in  the  grand  fir  timber  type  (dead 
fuel)  exercise.     This  data  set  consisted  of  780  data  points,  ranging  from  zero  (40 
points)  to  200  cm,  the  greatest  depth  recorded  (16  points) .     The  algorithm  first 
recovered  a  grouping  of  488  data  points  between  1  and  6  cm  with  an  average  depth  of 
10.46.     This  left  117  data  points  scattered  between  18  and  200  cm,  of  which  the 
obviously  most  prominent  group  was  the  concentration  of  16  data  points  at  200  cm.  This 
last  group  was  clearly  spurious.     On  other  exercises  of  the  algorithm,  more  subtle  but 
still  spurious  results  were  detected. 

In  most  cases  the  algorithm  appeared  to  have  worked  well  in  recovering  one  or  two 
groups  from  the  raw  data.     Frequently  a  third  attempt  yielded  spurious  results;  often 
the  data  were  insufficient  to  substantiate  the  existence  of  a  third  significant 
distinctive  group. 


3Aldrich,  David  F.,  and  Robert  W.  Mutch.     Fire  management  prescriptions:  a  model 
plan  for  wilderness  ecosystems.     USDA  For.  Serv. ,  Intermt.  For.  and  Range  Exp.  Stn. 
(In  preparation.) 

^Pfister,  R.  D.,  B.  L.  Kovalick,  S.  A.  Arno ,  and  R.  C.  Presby.     Forest  habitat 
types  of  Montana.     USDA  For.  Serv.,  Intermt.  For.  and  Range  Exp.  Stn.     (In  preparation.) 
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Figure  5 . --Histogram  of 
dead  fuel  depth  measure- 
ments less  than  50  om 
taken  in  lodgepole  pine 
timber  type  area  of  White 
Cap  Drainage  Wilderness 
Fire  Management  Study. 


Depth  of  fuel  (dead), cm 


An  example  of  such  an  exercise  is  the  lodgepole  pine  timber  type  (dead  fuel  only) 
data  from  the  cited  study.     These  data  consisted  of  396  depth  samples  from  zero  to 
190  cm,  which  included  35  zero-depth  data  points  and  the  erroneous  replication  of 
6  points.     Of  the  361  nonzero  data  points  processed  by  the  algorithm,  all  but  18  were 
measurements  of  less  than  50  cm.     These  18  points  were  widely  scattered- -the  only 
concentrations  being  3  points  at  59-60  cm,  and  3  at  90  cm  (1  a  spurious  replicate) . 
The  less  than  50-cm  data  are  shown  in  the  histogram  of  figure  5. 

These  data  were  processed  by  the  algorithm,  resulting  in  the  tapered  bin  width 
histogram  of  figure  6.     In  this  histogram,  a  prominent  group  is  clearly  evident.  This 
group  has  an  average  depth  of  2.54  cm  and  contains  223  data  points. 

The  histogram  of  the  remaining  138  (120  below  50  cm)  data  points  is  shown  in 
figure  7.     Once  more  these  data  were  processed  by  the  algorithm,  resulting  in  the 
modified  histogram  shown  in  figure  8.     Again,  a  contiguous  group  is  clearly  evident 
in  the  modified  histogram,  which  shows  all  the  data  because  the  scale  is  so  contracted 
here.     The  group  apparent  in  the  latter  histogram  averages  16.04  cm  in  depth  and  con- 
tains 118  data  points. 

The  remaining  20  data  points  were  processed  again,  resulting  in  a  spurious  group 

of  3  data  points  at  90  cm.     This  last  group  was  rejected,  but  the  other  two  groups  were 

found  to  be  acceptable  as  representative  of  needle  litter-mat  and  fallen  dead  woody 
material,  respectively. 
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Figure  6. — Histogram  of 
figure  5  as  recon- 
structed by  group- 
finding  algorithm. 
Prominent  group  in 
first  two  bins 
contains  223  data 
points . 
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Figure  7 . --Histogram  of  figure  5  after  removal  of  1-5  cm  group, 


50 


13 


Figure  8. — Histogram  of 
figure  7  ( extended  to 
include  all  data)  as 
reconstructed  by  group- 
finding  algorithm. 
Prominent  group  in 
first  six  bins  contains 
118  data  points. 


Min.depth>    6  8  12  17   32    52    78    110  148  169 
Max.deprh<  8  12  17  24  42    65    94    128  169  191 
Range  of  fuel  depths  (cm) 
rounded  to  proper  integer  value 


Computer  processing  of  these  data  greatly  expedited  the  chore  of  establishing 
representative  depths  for  fuel  arrays  characteristic  of  various  areas  in  the  study 
region.     Augmented  by  judicious  review  of  the  computer  output,  this  method  may  be  useful 
to  other  researchers  holding  similar  sets  of  data. 


The  computer  program  listing  (FORTRAN  IV)  is  given  in  the  appendix.     Copies  of  the 
program  may  be  obtained  by  contacting  the  author  at  the  following  address: 

USDA  Forest  Service 

Northern  Forest  Fire  Laboratory 

Drawer  G 

Missoula,  MT  59801 


Brown,  James  K. 

1971.  A  planar  intersect  method  for  sampling  fuel  volume  and  surface  area.  For. 
Sci.  17(1) :96-102. 

Rothermel,  Richard  C. 

1972.  A  mathematical  model  for  predicting  fire  spread  in  wildland  fuels.  USDA 
For.  Serv.  Res.  Pap.   INT-115,  40  p. 

USDA  Forest  Service. 

1974.     National  fuel  classification  and  inventory  system:  preliminary  draft.     24  p. 
Division  of  Fire  Management,  Washington,  D.C. 

Van  Wagner,  C.  E. 

1968.     The  line  intersect  method  in  forest  fuel  sampling.     For.  Sci.  14:20-26. 


Program  Available 
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Headquarters  for  the  Inter  mountain  Forest  and 
Range  Experiment  Station  are  in  Ogden,  Utah. 
Field  Research  Work  Units  are  maintained  in: 

Boise,  Idaho 

Bozeman,    Montana  (in  cooperation  with 

Montana  State  University) 
Logan,    Utah  (in  cooperation  with  Utah 

State  University) 
Missoula,    Montana  (in  cooperation  with 

University  of  Montana) 
Moscow,  Idaho  (in  cooperation  with  the 

University  of  Idaho) 
Provo,  Utah  (in  cooperation  with  Brigham 

Young  University) 
Reno,   Nevada  (in  cooperation  with  the 

University  of  Nevada) 
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